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Abstract 

7— I 1 

— — ' Under multiplicative drift and other regularity conditions, it is established that the asymp- 

totic variance associated with a particle filter approximation of the prediction filter is 
bounded uniformly in time, and the non-asymptotic, relative variance associated with a 
particle approximation of the normalizing constant is bounded linearly in time. The con- 
ditions are demonstrated to hold for some hidden Markov models on non-compact state 
spaces. The particle stability results are obtained by proving w-norm multiplicative stability 

i— i and exponential moment results for the underlying Feynman-Kac formulae. 

°: 

+j ', 1 Introduction 

ctf ■ 
-«-> ■ 

i ' Particle niters have become very popular devices for approximate solution of non-linear filtering 

' problems in hidden Markov models (HMM's) and various aspects of their theoretical properties 

' are now well understood. However, there are still very few results which establish some form of 

stability over time of particle filtering methods on non-compact spaces, at least without resorting 
to algorithmic modifications which involve a random computational expense. The aim of the 
present work is to establish theoretical guarantees about some stability properties of a standard 
particle filter, under assumptions which are verifiable for some HMM's with non-compact state 
, spaces. 

j> ! It is now well known that, under mild conditions, the error associated with particle approx- 

•l-H , 

imation of filtering distributions satisfies a central limit theorem. The first stability property 
we obtain is a time-uniform bound on the corresponding asymptotic variance. Making use of 
some recent results on functional expansions for particle approximation measures, the second 
stability property we obtain is a linear-in-time bound on the non-asymptotic, relative variance 
of the particle approximations of normalizing constants. These two properties are established 
by first proving some multiplicative stability and exponential moment results for the Feynman- 
Kac formulae underlying the particle filter. The adopted approach involves Lyapunov function, 
multiplicative stability ideas in a weighted oo-norm setting, which allows treatment of a non- 
compact state space. We thus obtain stability results which hold under weaker assumptions than 
those existing in the literature. The main restriction is that our assumptions are typically satis- 
fied under some constraints on the observation component of the HMM and/or the observation 
sequence driving the filter. On the other hand, subject to these constraints, our stability results 
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hold uniformly over observation records and without any stochasticity necessarily present in the 
observation process. 

The rest of this paper is structured as follows. Section [2] briefly introduces filtering in 
HMM's, particle filters, and comments on some existing stability results. Section [3] gives some 
applications of the main particle stability results to classes of hidden Markov models. The hope 
is that sections [2] and [3] can be read without the reader necessarily delving into the main results 
of section 0] or the corresponding proofs and auxiliary results of section [5j which are obtained in 
the more abstract setting of interacting particle approximations of Feynman-Kac formulae. 

2 Setting 

2.1 Hidden Markov models and filtering 

A hidden Markov model is a bi-variate, discrete time Markov chain ((X n , Y n ) ; n > 0) where the 
signal process (X n ) is also a Markov chain and each observation Y n is conditionally independent 
of the rest of the bi-variate process given X n . Each X n is valued in a state-space X and each 
Y n is valued in the observation space Y. The present work focuses on the case where X is 
non-compact, and we are typically interested in the case that X is some subset of M. d . In any 
case, throughout the following we assume that X and Y are Polish spaces endowed with their 
respective Borel u-algebras, B (X) and B (Y). Our main stability results, presented in section @J 
are in the setting of Feynman-Kac formulae which can be considered as underlying the filtering 
problem of interest. In that section, more precise definitions are given. In the present section, 
we consider the HMM directly. 

Let \i be a probability distribution on X, let / be a Markov kernel acting from X to itself and 
let g be a Markov kernel acting from X to Y, with g(x, •) admitting density, similarly denoted 
by g (x,y), with respect to some dominating cr-finite measure. We will assume that g (x,y) > 
and, for now, that sup E ^ g(x, y) < oo. Loosely speaking, the task of filtering is to compute some 
conditional distributions of the (X n ) process given the observations (Y n ), under an assumed 
model: 



For a realization of observations (jjq, y±, . . . , ), we may take as a recursive definition of the (one- 
step-ahead) prediction filters, the sequence of distributions (iT n ;n > 0) following 



(X ,Y ) ~ n (dx ) g (x ,dy ) , 

(X n ,Y n ) \ {X n _i = rc n _i} ~ / ( dx n ) g (x n , dy n ) 



n > 1 



(2.1) 



7T (dx ) := jU (dx ) 



Tr n (dx n ) : 



Jx 7T TO _i (cfa n _i) g (x n _i, y n -i) f(x n -i,dx n ) 
Ix^n-i (dx n -i)g (x n _i,y n _i) 



n > 1. 



(2.2) 



We also define the sequence (Z n ;n > 0) by 




n > 1. 



(2.3) 
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Note that the dependence of ir n and Z n on yo-.n— l = (2/0 > • • • Vn-l) is suppressed from the notation. 
Unless stated otherwise, whenever (ir n ) or (Z n ) appear below it should be understood that 
they depend on an arbitrary but fixed and deterministic Y-valued sequence (jjq, y\, . . .). The 
same applies for the particle approximations introduced in section 12.21 The set of observation 
sequences for which our particle variance results hold is made precise and discussed in section |3j 

Under the model (|2,ip . ir n is the conditional distribution of X n given {Yo :n _i = yo-.n— l}j 
and Z n is the joint density of io:n-l evaluated at g/o:n-l- The convention of working with the 
one-step-ahead quantities is mostly for simplicity of presentation in the following. 

In applications there typically will be some degree of model mis-specification; perhaps the 
data generating process (X n ,Y n ) is not distributed according to (|2.ip with this particular //, / 
and g, or perhaps (Y n ) are not the observations from an HMM at all (for ease of presentation we 
purposefully avoid giving a name to a "true" distribution for (Y n )). Never-the-less, as (yo, y%, . . .) 
arrive our aim is to compute, or well-approximate (n n ) and (Z n ) as per (|2.2p - (|2,3p with some /J,, 
f and g of our choosing. 

HMM's are simple and yet flexible models which have found countless applications. However, 
under choices of /i, / and g which are desirable in many practical situations, (7r n ) and (Z n ) are 
not available in closed form. 



2.2 Particle filtering 



Particle filters 



Gordon et al 



Il993l | are a class of stochastic algorithms which yield approxi- 
mations of (ir n ) and (Z n ) using a population of N samples which interact over time. These 
approximations will be denoted by (tt^) and (Z^) ■ Algorithm [1] is perhaps the most simple 
generic particle filtering scheme (a more precise probabilistic definition is considered in section 
[4]). At time n > 1, the sampling step performs a selection- mutation operation and is equivalent 
to choosing, with replacement, individuals from the population on the basis of their fitness, 
proportional to g (•, followed by them each mutating in a conditionally independent man- 

ner according to / . 



Algorithm 1 



For n=0, 



Sample (£o) i=1 ~ A 4 



Report ^ = 1^^, Z» = l. 

i=i 

For n > 1, 

1 N 

Report Z* = Z^_ x —^g ^ n _ v y n -i), 



j'=i 



1 t*\N \ N iid ^= l9 [^n-VVn-l) f (Cn-D 
Sample (^). =1 (^_i) i=1 ~ 



1 N 

Report tt^ = jjJ2 d &- 



i=l 



A large number of variations and extensions of this algorithm have been developed. A full 
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survey is well beyond the scope of the present work, but a few comments are called for. Firstly, 
Algorithm [1] implicitly uses "multinomial resampling" at every time step. It would be interesting 
to investigate simil ar results to those presented here for o ther resampling schemes, for example 



via the analyses of IChopinl [2004], 



Del Moral et al. 



201 1| . Secondly, Algorithm [T] involves mu- 



tation at every time step according to the Markov kernel /. Again, various alternative schemes 
have been devised. Mutation according to / is not an essential characteristic of the main results 
of section U and it is only for simplicity that the results of section [3] are presented in this context. 
Thirdly, the results presented here are likely to be relevant to relate d classes of sequential M onte 
Carlo methods, f or example the smoothing algorithms treated by iDel Moral et al.l [20101 ] and 



Douc et al 



2011 1- 



2.3 Existing stability results for particle niters 

One o f the first and most influential works on stability of particle filters is that of 



Del Moral and Guionnet 



2001] who established time- uniform convergence properties of the particle approximations. They 



required uniform upper and lower bounds on g and stability of the corresponding exact filter, in 
turn derived using quite strong assumptions on / involving simultaneous, uniform minorization 
and majorization, whi ch are rarely satisfied then X i s non-compact. Similar mixing assumptions 



lave b een employed in iLeGland and Oudiand [20041 ] , IChopinl 20041 ] , 



Kiinsch 



20051 ] 



Cerou et al 



20111 ] in order to establish (respectively) uniform convergence of particle filtering approxima- 



tions; a time-uniform bound on the asymptotic variance; and linear-in-time bounds on the 
non-asymptotic variance of the normalizing constant. All also consider variants of the standard 
par ticle filter in Algorithm [H 




LeGland and Oudiand [20031 ] developed truncation ideas in order to achieve uniform particle 



approximations without mixing assumptions, but with random comp utational cost and/or pro- 
posals restricted to compact sets. A further development was made bv lOudjane and Rubenthaler 
2005l | . allowing treatment of some non-ergodic signals via a particle filter in c orpor ating an ac- 
cept/reject step. Truncation ideas have also been used in 



Heine and Crisan 



2008l | in order to 



obtain uniform convergence of particle filter approximations for HMM's on non-co mpact state 
space s with quite specific structure (including X and Y being of the same dimension). 



van Handel 



[20Q9l | has established u niform convergence of time-a veraged filters under tightness assumptions 
on non-compact spaces. IDel Moral and Jacodl |200l| proved tightness of the sequ ence of asymp- 



Favetto 



totic variances (as a function of random observations) in the linear-Gaussian case, 
has proved tightness of the same for a class of HMM's, but subject to a mixing assumption on 
/• 

It is stressed that: 1) a time-uniform bound on the asymptotic variance for tr^ and 2) a 
linear-in-time bound on the relative variance for , as pursued here, are different properties 
from the time-uniform convergence results proved in most of the ab ove. The exis ti ng works 
featu ri ng the most sim ilar type of results to t hose considered here are Chopinl . l2004l | , Kiinschl . 



2005| . [Favettol . 120091 ] and 



Cerou et al 



201 1| . all of which rely on strong mixing assumptions, 



at least on /, which we do not invoke. 

The overall approach used in the present work to express 
ciated functionals is the semigroup formulation of IDel Moral 



'eynman-Kac formulae and asso- 



2004], but the stability ideas are 
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different and are base d around a weighted oo-norm funct ion space setting. In Theorem [TJ the 
decomposit i on id ea of Kleptsvna and Veretennikov |2008| and some technical approaches from 



Douc et al 



20091 ] are employed. 



For completeness we also mention the following. IWhitelevI [201 ll | considered stability prop- 
erties of a related class of sequential Monte Carlo methods which are not used for filtering and 
operate in a different structural regime, where the number o f distributions involved may be 



considered a parameter of the algorithm. [Whitelev et al.l |201ll | considered relative variance for 
in the context of time-homogeneous Feynman-Kac models (obtained in the present setting 
by setting all yo,yi,... to a constant), appealing to spectral properties of the integral kernel 
involved. There is nothing explicitly spectral about the present work, but there are some re- 
lated structural ideas involved (see section [T]). For example, Theorem Q] is expressed in such 
a way that it may be viewed as an non-homogen eous analogue of the u-norm multiplicative 
ergodicity results of iKontoviannis and Mevnl [2005|, in the context of positive operators. The 
assumptions in the present work also allow the treatment of time-homog eneous Feynman-Kac 
models, and in that setting are actually stronger than the assumptions of I Whitelev et al.l [201 1 1 
(because in assumption (H3j)-(HHJ) of section H~2l here, we require a sim ultaneous local minori za- 
tion/majorization condition), but on the other hand the approach of I Whitelev et al.l 201 ll | is 
specific to the time-homogeneous setting. 



3 Summary and application of some results 

In this section, the results of section [T] are summarized and applied to some specific hidden 
Markov models and the particle filter of Algorithm [TJ To this end we consider the following 
assumptions on fi, f and g which serve as an intermediate layer of abstraction and which 
together imply that assumptions (HTJ)-(II[5]) of section [T] are satisfied. Discussion of the latter 
assumptions and their relation to the existing literature is given in section [4.1,11 
Consider the following: 

• Y* C Y is measurable, and the quantities in the below conditions may depend on Y*. 

• There exists V : X — > [1, oo) unbounded, d G [1, oo) and 5 > with the following properties. 
For each d G [d, oo), 

g(x, y) [ f(x, dx') > 0, Mx G X, y G Y*, (3.1) 
Jc d 

where := {x : V(x) < d}, and there exists < oo such that 

sup g(x,y) / f(x,dx')exp[V(x')] 
yeY* ix 

<e W [V(x)(l-5) + b d I Cd (x)}, VxGX, (3.2) 
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and there exists a probability measure v d and < e d < < oo such that 

qv d (dx') l Cd (x') 

< 9 (x, y) f [x, dx') l Cd {x') 

< e+u d {dx') l Cd {x') , Vx£C d ,y£ Y*, (3.3) 

with v d (C r ) > for all r G \d,d] 

• f exp [U(x)] fi (dx) < oo 

• Although not required for all results of section |U in the present section it is also assumed 
that 

sup g{x,y) < oo. (3.4) 

(z,y)eXxY* 

The condition of (|3.2|) is a multipl icative drift condit ion. Similar conditions have been used in the 



study of stability of exact filters [Douc et all 2009] and can hold when Y* = Y is non-compact. 
It may be the case that / alone satisfies such a multiplicative condition (see section [331 below) . 
in which case (j3.2[) can be satisfied when sup ye y g(x,y) is not bounded above in x. When (|3.4p 
holds, then fj3.2j) can hold even when / is not ergodic, but it is then typically required that 
Y* C Y is compact (see section l3~2]) . The conditions of (j3.3[) and (|3.4p together imply that for 
all d G \d, oo), 

g{x,y) ^ 

sup sup — -— — - < oo, 

yeY* ( x ,x')eC d xC d 9{x ,y) 

which can, loosely, be interpreted as a constraint on the amount of information which any single 
observation in Y* can provide about the hidden state in each C d . For the example of section 
13. 1.11 we are able to satisfy the assumptions when Y* = Y is compact. For non-compact Y in the 
examples below, we resort to taking Y* compact. 

Under the above assumptions, the main conclusions of Propositions [3] and |U section |4~5"1 may 
be summarized as follows. 



Uniformly bounded variance in the CLT for 7r 



It is known (e.g. [Del Moral! . 12004 . Section 9.4.2]) that under (fcT4"|) . for any tp : X -> R bounded, 



measurable, n > 1 and any yo :n 



N / [tt^ (dx) - 7r n (dx)] <p(x) — > M (0, al (y 0:n )) 



x 

in distribution as iV — > oo. Under the conditions of (|3.ip - (|3,4p . Proposition |3] may be applied to 
establish there exists < oo depending on Y*, such that for all such ip and n > 

a 2 n (y 0: n) < Var^ n (ip) + \\^\\ 2 Vy 0: n G Y^ +1 , (3.5) 
with ||-|| the sup norm. Discussion of a CLT for other classes of ip is given in section 14.5.11 
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Linearly bounded relative variance for Z„ 



N 



Under the conditions of (|3,ip - (|3,4p . Proposition @] may be applied to establish that there exists 
< oo depending on Y* such that for all n > 0, 



N>c'(n + 1) 



E, 



Zn. 



< c ;_(n+l), VfeGY 



n+l 



(3.6) 



where is expectation with respect to the law of the iV-particle filtering algorithm initialized 
using \x. 

3.1 A class of ergodic signal models 



The following class of signal model has been considered bv iKleptsvna and Veretennikovl [200 
and iDouc et al.l [2009l | in the context of stability of exact filters (i.e. without particle approxi- 



mation). We have X = M. dx for some d x >1. The transition kernel / corresponds to the signal 
model 



X n+1 = X n + B(X n ) + a (X n ) W n , (W n ; n > 1) ~ N (0, I d: 



(3.7) 



with 



B is a c^-dimensional vector function, locally bounded and 



lim sup \x + B{x)\ — \x\ = — oo 

r ->°°\x\>r 



(3. 



a is a d x x d x matrix function, and has the so-called non-degenerate noise variance property 
< inf inf X T a(x)a T (x)X < sup sup X T a(x)a T (x)X < oo. (3.9) 

xG Rd x AeR^,|A|=l X £ R d x AeR^,|A|=l 



As per Lemma U] in the appendix, / in this case itself satisfies a multiplicative drift condition 
with v(x) := exp (1 + c|x|) for c a positive constant. An example of a possible signal model 
with non-Gaussian transition probability and / itself sat isfying a multip l icatiy e drift condition 
is the discretely sampled Cox-Ingersoll-Ross process, see Whitelev et all l2011 |. 

We now discuss some observation models which may be combined with the signal model 
above. 



3.1.1 Discrete- valued observations 

With Y = {0, l} dx , consider the multivariate binary observation model 

Y^) {X n = x n } ~ Be [p (4)) ® • • • 8) Be (p ( 
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where Be denotes the Bernoulli distribution, p(x) : = l/(l + e x ) and Y n = (Y^, . . . , Y^ x } , 
x n = (x* , . . . , x„ ) . This corresponds to 

?(^)=n P (^ =i1 (i-pmi^ 1 . 

Clearly sup x y g(x, y) = 1 and for any compact C C M. dx , mf x£ c hifj/eY y) > 0. Combined 
with Lemma HI this establishes that the assumptions of equations (|3.ip . (j3.2|) and (|3.3p are 
satisfied when this observation model is combined with the signal model of equation (|3,7|) - (|3.9|) . 

3.1.2 Uninformative observations in R d 

With Y = R d w, dy > 1, Consider the observation model 

Y n = H(X n ) + ( n , (C n ;n>l)^Af(0,I dy ), 

with H a bounded, vector-function. That the disturbance terms are standard normal here is 
only for simplicity of presentation. Obviously we have 

1 ( 1 r mT 



y ) = ex P ^ ~ ^ ~ 

so that sup/^jAwx Y) 5'( x ' 2/) = (27r) — . In this case the observations may be considered 
uninformative as for each y, inf xe x 9( x ,y) > 0. In light of Lemma [4] standard calculations show 
that this observation model combined with / of (|3.7p - (|3.9p satisfies the drift condition of (13.211 
with Y* = Y and d chosen large enough. However, when we attempt to verify fj3.3j) (via ()5.2[) in 
Lemma Hf by incorporating g(x,y), the minorization part of (|3.3p is not satisfied with Y* = Y, 
due to the requirement of uniformity in y. We may satisfy (|3.3p by taking Y* C Y a compact 
set, and the constants involved will then depend on Y*. 

3.1.3 Stochastic volatility observations 

With Y = R and d x = 1, consider the stochastic volatility observation model (considered in 



Douc et al. 



20091 . Section 4.3]), 



Y n = p exp (X n /2) e n , (e n ; n > 0) ~ N (0, I dx ) 
where /3 > is a fixed parameter of the model. The corresponding likelihood is 

s( x > y) = — ~m~ ex p i~y 2 ex p /( 2 ^ 2 ) ~ x / 2 ] ' 



Douc et al 



2009, Section 



which is not uniformly upper-bounded on X x Y. But, as noted in 
4.3], sup^x g{x,y) < (2vre) _1/2 |y| _1 . For < y < y < oo, take Y* := [-y, -y\ U \y,y]. Then 
()3.4p is satisfied and using Lemma [U the drift condition of (j3.2j) and the upper bound of (|3.3p 
are satisfied with d large enough. The lower bound of (I3.3|) is also satisfied because for d < oo, 
mf {Xty)eCdXYir g(x,y) > 0. 
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3.2 A class of possibly non-ergodic signal models 

We now consider a class of signal model which includes some non-ergodic / and point out how 
characteristics of the observation model can be used to satisfy the drift condition (|3.2p . 
Take X = M. dx for some d x > 1 and consider the signal model 

X n+1 = B{X n ) + W n , (W n ;n>0)~N(0,I dx ), (3.10) 

with B is a <i x -dimensional vector function, locally bounded. That the disturbance terms (W n ) 
are standard Normal is only for simplicity of presentation; one can draw analogous conclusions 

under conditions such as (I3.9p . but we focus here on the interplay between V, Y* , B and g. For 

X T x 

some 5q > 1, take V(x) := — — r + 1. 

2 (1 + 6q) 

Assuming that Y*, B and g are such that, for some 5\ € (0, 1), 

lim sup sup + ^-B T (x)B(x) + log g{x,y) < 0. (3.11) 

r ^°° |aj|>r y£Y* 2 (1 + d ) 2d 

standard manipulations then establish that the drift condition of (|3.2p is satisfied with 6 < 6\ 
and d large enough. For the condition of (|3,2p . again with d large enough we can take Vd the 
normalized restriction of Lebesgue measure to if it is the case that 

inf g(x,y)>0. (3.12) 
(x,y)eC d xY* 

The conditions (|3.1ip and ()3.12p are satisfied, for example, when 

• The signal model is a random walk, B(x) := x 

• Y = R dy , 

Y n = H(x) + o y C n , (C„; n > 0) ~ M (0, I dy ) 



1 fir M T 



with (T y > 0, so that 

9(x, y) = ( ^ )V2 ^ exp [-^Jy- H{xt [y ~ H(x)} 

• Y* is compact 

• H is locally bounded and such that 

X T X + if A / \T tt / ^^ H(x) T H(x) 

lim sup - ^7—^ + — SU P y SU P A H W ~ H 2 < °- 

For the latter condition, we observe a trade-off in terms of 5q (which defines V), the constant 
5i (6 < Si appears in the drift condition), the observation noise variance and the growth of 
H(x). 
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4 £ w -stability of Feynman-Kac formulae and particle approxima- 
tions 

4.1 Definitions and assumptions 

As per the introduction, let the Polish state space X be non-compact and endowed with its Borel 
cr-algebra B(X) (the observation space Y will not feature explicitly in the following Feynman- 
Kac formulation, see Remark 2 below). For a weighting function v : X — >• [l,oo), and ip a 
measurable, real-valued function on X, define the norm \\ip\\ v '■= sup xG x\<p(x)\/v(x) and let 
C v := {ip : X — > R; \\<p\\ < oo} be the corresponding Banach space. Throughout, when dealing 
with weighting functions we employ an lower /upper-case convention for exponentiation and write 
interchangeably v = e v . 

For K a kernel on X x B(X), a function ip and a measure r\ denote r](<p) := J f(x)r](dx), 
Kip(x) = K {(p) (x) := J K(x,dy)ip(y) and rjK(-) := J rj(dx)K(x,-). Let V be the collection 
of probability measures on (X, B(X)), and for a given weighting function v : X — > [1, oo) let V v 
denote the subset of such measures r] such that rj(v) < oo. 

The induced operator norm of a linear operator K acting C v —> C v is 

|||iq|„ :=sup{^^;99E£„||99||„/o} = sup{||l^||„ ;y> G £„, M < v] . 
L IIVIIu J 

The corresponding u-norm on signed measures is \\rj\\ v := sup^ <v r](v). For any n > 1 and 
1 < s < (n + 1), define X njS := {(ix, ...,i s ) G Ng; < i\ < . . . < i s < n}. 

Let fi G V be an initial distribution and for each n G N let (M n ;n > 1) be a collection 
of Markov kernels, each kernel acting X x B(X) — > [0,1]. Let (G n ;n > 0) be a collection of 
^(X)-measurable, real- valued, strictly positive functions on X. 

Next let (Q n ; n > 1) be the collection of integral kernels defined by 

Q n (x,dx') := G n -i(x)M n (x,dx'). 

For 1 < p < n, let Q Ptn be the semigroup defined by 

Q P ,n ■= Qp+i ■■■Qn, P <n, (4.1) 

Qn,n = Id and by convention Q n +i jn = Id. 

We now introduce our first two assumptions, which will be called upon in the following. 

(HI) There exists V : X — > [l,oo) unbounded and constants 5 > and d > 1 with the fol- 
lowing properties. For each each d G [d, oo) there exists < oo such that the following 
multiplicative drift condition holds: 

supQ n (e y ) < e ni-5)+Mc dj (4.2) 

n>l 

where C d := {x G X; V{x) < d}. 
Whenever (HI]) holds we may also consider: 
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(H2) neV v , where v = e v is as in (HQ. 



We may now proceed with some further definitions. Define the collection of measures 
(ln',n > 0) and probability measures (r] n ;n > 0) 



ln (A):=fxQ 0>n (A), Vn (A):=^l, A e B(X), 

In (1) 



(4.3) 



where the dependence of (^ n ) and (r] n ) on the initial distribution fi is suppressed from the 
notation. 

Before going further we note the following elementary implications of the assumptions 
(HI]) and (H[2j) introduced so far. Assumption (HI]) implies that for all n > 1 and x £ X, 

Q n (e v ) (x)/e v ^ < e b ^ < oo and thus for all < p < n and x G X, 



Q P) n (e y ) (x) < oo, 



(4.4) 



Combined with Assumption (H21), we also observe that that for all n > 0, r/ n £ V v . 

It is straightforward to verify that the unnormalized measures (7„) have the following product 
representation: 



n-1 



j n (A) = Hrj p (G p ) Vn (A), n>\. 

p=0 



(4.5) 



We denote by the expectation w.r.t. to the canonical law of the non-homogeneous Markov 
chain (X n ;n > 0) where X$ ~ [i and X n \ {A n _i = ~ M n (x„,_i, •). For p < n and a 

suitable test function ip we abuse notation by writing 

E p , x [<p(X p ,...,X n )] := E tl [<p(X p ,...,X n )\X p = x], 

and for a probability measure r\ we write 

E PtV [cp(X p ,...,X n )] := f r l (dx)E PtX [ ( p{X p ,...,X n )}. 

Jx 

Under these notational conventions we have, for < p < n and r\ E V, the identity 

Tn-l 



VQp,n( A ) = E 



p. >l 



Y[G q (X q ) I [X n G A] 



q=p 



In particular, 



r]pQp,n(A) = E. 



PMv 



n-1 



Y[G q (X q )I[X n eA\ 



.q=p 



n-1 



Y[v q (G q ) Vn (A), 



q=p 



due to (|4.3p and (|4.5p . which will be used repeatedly. 
Definition 1. (X-values and h- functions) . For n > let 



\ n :=r] n (G n ) 
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and for < p < n let h niP : X — > (0, oo) be the function denned by 

Vn(x):=l, h p , n {x) := Q ^S ) 1 (a) , P<n. (4.6) 

Remark 1. It is stressed that each A p , and therefore each /i Pi n, depends implicitly on the initial 
distribution /i. This plays a key structural role in the proofs which follow. With the exception 
of Corollary [TJ throughout the following [i should be understood as arbitrary but fixed. 
The two other main assumptions are the following. 

(H3) With d as in (HTT]), for each d G \d, oo), 

Q n (x, C d ) > Vx G X, n > 1, 
and there exists eT > and £ V v , such that 



inf Q n (x,C d nA) > ejv d (C d nA), Vx G C d , A € B (X) . 

n>l 

with f d (C r ) > 0, for all r G [d, d]. 
When and (Hj3j) hold, we may also consider: 

(H4) With d as in (H[T]) and (^), (e^") as in (H[3]), for each d G [d, oo) there exists ej G [e^, oo) 
such that 

sup Q n (x,C d DA) < e%v d {C d nA), Vx G C d , A G B (X) . 

n>l 

4.1.1 Comments on the assumptions 

Assumption s (H[3])-(H3]) taken together are more specific than the local-Doeblin condition of 



Douc et al 



2009] (when the latter is considered as holding for non-negative kernels) because 



they are phrased in terms of the level sets for V and hold time-simultaneously. It is possible to 
obtain results which are the analogue of those presented herein under multi-step versions of (EBJ)- 
(HU), but this involves substantial notational complications which would obscure presentation. 

Assumption (HI]) is a type of multiplicative drift condition involving the Markov kernels 
(M n ) and the potential functions (G n ). A notable characteristic of this assumption is that it 



implies that for all e > there exists d > d such that 
itself time-simultaneous variation of 



Douc et al. 



Q n — Ic d Qn\\\v < £ f° r all n > 1, which is 



2009, condition H2]. In the present work, the 



sublevel sets of V play a central role in the proof of Theorem [2] and thus Proposition [JJ 

In the above definitions the functions (G n ) have been taken as strictly positive. It would be 
interesting to also consider vanishing potential functions, but that situation is more complicated 
as the particle system may become extinct. 
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4.1.2 Particle system 



The particle system may be considered a canonical non-homogeneous Markov chain and therefore 
its definition is only sketched. For N > 1, and each n > let £ n = , . . . , , be a X^-valued 
and then define 



7o" 



7 ^ := 

In ■ 



1 N 



n > 0, 



i=l 



n-1 

n < 

p=0 



^, n > 1. 



The particle system of population size TV is the X^- valued Markov chain with transitions given 
symbolically by 



j. iid 



C -lQn (•) 
1 (Gn-l) 



N 
in- 



n > 1. 



Remark 2. In order to obtain algorithm [T] take G n {x) := g(x,y n ), M n (x,dx') := f(x,dx'). 
In that case 77^ = 7r^ and 7„ (1) = and similarly, 7/ „ = 7r n , 7 n (l) = Z„. Ot her particle 
filters (such as the "fully-adapted" auxiliary particle filter of lPitt and Shephardl 1999J) arise from 
other choices of G n and M n . More generally, the state-space X may be augmented, e.g. to X 2 , in 
order to accommodate M n corresp onding to other cho ices of proposal kernel and corresponding 
importance weight, see for example 
versions of (H3])-(Ifi|. 



Doucetetal 



2000] . In such cases one would need multi-step 



4.2 Uniform ^-controls 

The main results of this section are Propositions [1] and [2j which establish uniform controls on 
the measures (rj n ), the A- values and the /i-functions. The uniform bounds of these propositions 
play a key role in the stability results which then follow. 

The first key ingredient is the following Lemma, which establishes some relationships between 
the measures (%), the A- values and the /i-functions. 

Lemma 1. Assume (HJ^-(IJ^). The measures (r] n ), h-functions and X-values satisfy, for any 
n > 1 and < p < n, the recursive formulae 



'HpQp+i — ^pVp+ii Qp+i (hp+i,n) — ^php,m 



(4.7) 



and 

Vp ( h P,n) = 1- 

Furthermore h pn G C v where v = e v is as in (lUty- 
Proof For the measure equation, 

ln{A) 7„_iQ n (A) rj n -iQ n (A) 



rin(A) 



Vn-lQn (A) 

7n(l) 7n-l<3n(l) Vn-lQn (1) Vn-1 (G n -l) ' 
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where the third equality is due to the product formula (|4.5|) . For the h- function equation, using 
Definition [H 

Q p -l >n {\) _ 1 Q p Q p ,n{l) _ 1 



h 



p—l,n 



Vi n 



n— 1 \ 



A 



p-1 



.n) 



lq=p-l /K Q 

The equality r] p (h p ^ n ) = 1 is direct from (|4.3p and the definition of h Ptn . The assertion h Ptn £ C v 
follows immediately from Definition [1] and (|4.4j) . □ 

The second key ingredient is the collection of kernels and drift functions identified in the 
following definition (that these kernels are Markov is a consequence of Lemma [1]) . 

Definition 2. (S-kernels and drift functions). For n > 1, 1 < p < n let S Pjn :XxB (X) — > R + 
be the Markov kernel defined by 



S P: n (x, A) : 



_ Qp {^Ahp,n) (x) 



^p-lhp-l,n{x) 



(4.8) 



and let v PyTl : X — > [l.oo) be defined by 



Vp t n (x) '•' 



v(x) 



\h 



p,n\\ v , 



where v is as in (HI]). 

" (n) 

For each n > 1 and rj E V, We denote by E^ expectation w.r.t. the canonical law of the 
(n + l)-step non-homogeneous Markov chain {X P:n ; < p < n} with Xo jn ~ 7] and for 1 < p < 
n,X p ^ n \ {X p _i in = Xp-i^n] ~ (xp-i in , •). By analogy to the definitions of section [4~Tl for 
each n > 1 we write 

Eg (X p>n , X n , n )] := Ef) (Xp, n , . . . , X n , n ) I X p>n = x] . 

The S'-kernels and the corresponding expectations are of interest due to the following change- 
of-measure identity. 

Lemma 2. Assume (HJ^-(I^). For any n > 1, < p < n, a suitable test function ip and x £ X, 

: h p>n ( X )E$ [<p(X ptn ,...,X n>n )] . 



Ep 5 a; 


'TV£G q (X q )<p(X p ,. 




F 


Hq=p G q (Xq) 





Proof. From Definitions [T] and [2j 

Ep,^ 



Y\ n Q =l G q ( X q) ¥ ( X P> ■ ■ ■ > X V 



VpQp,n (1) 



hp,n(.x)^ P: . 



n-1 



n G q (X q ) hq + i n (X q+1 ) 
X 1 7v~~\ — V V A 0, • • • , -X-n) 



.q=p 



hq,n \ X q 



hn,n ( X n) 



h pjn (x)t p n) x [if [Xp in , X n>n )] 



□ 
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Remark 3. The S- kernels have previously been identified as playing a key role when analyzing 



stabi lity properties of Feynman-Kac formulae and particle systems, see Del Moral and Guionnetl . 



20011 ] . albeit written in a slightly different form. From Definition Q] we have immediately that 

Q p (lAh Pt n) (x) _ Qp (IaQpA 1 )) 0*0 



5p,n {%■> A) 



l,n 0*0 Qp-l,n{ l )( X ) 



and it is in the latter form that these kernel are usually considered. However, in the context 
of the Lyapunov drift techniques employed here, (|4.8p expressed in terms of the A-values and 
/i-functions plays a central role in proofs of the two following propositions. The main theme 
of the proof of Proposition Q] is to obtain uniform bounds on via the representation of 

Lemma [21 the identity Ep™2 [v (X njn )] = S p +\^ n ■ ■ ■ S n , n (v) (x) and the drift functions (v Ptn ). 
Note that Proposition Q] does not require the majorization-type assumption (H|4]). 

Proposition 1. Assume (HJ^-(I^) and let v be as therein. Then there exists a finite constant 
c M depending on fx and the quantities in (Hjty and (H^, such that 

sup ||r7nlU < CnfJ-(v). 

n>0 

Proof. See section [SJ □ 

The interest in the uniform bound of Proposition [1] is that, via the following proposition, we 
obtain some uniform bounds on the A-values and /i-functions. 

Proposition 2. Assume (HJ^-(I^) and let v be as therein. Then l)-2) below are equivalent. 
!) sup n > ||?7nlL < oo 

2) inf n > X n > 

If additionally (H|4j) holds then 1) and 2) are equivalent to 3): 

3) sup n >! sup < p < n \\h Ptn \\ v < oo 



Proof. Lemmata [7] [5] and See section 



□ 



Before proceeding further, note that in the results from this point on, the statements often 
feature a constant c M . The value of this constant may change from one result to the next. 



4.3 A multiplicative stability theorem 

The form of the following re sult can be interpreted as a non -homogeneous analogue of the multi- 
plicative ergodic theorem of iKontoviannis and Mevnl 20051 in th e conte xt of positive operators, 

Theorem 2.2, equation 



Whitelev et al. 



2011 



for direct comparison the reader is referred to 
(2.10)]. This proposition will be applied in section [4~5l to bound the asymptotic variance asso- 
ciated with (77^) ■ The proof is postponed. 
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Theorem 1. Assume (TTIty-(I^). Then there exists p < 1 depending only on p and the constants 
in an d an d c fi < 00 depending on the quantities in (Iflty-(I^) such that for any 

<p G C v , n > 1 and < p < n, 



Q P ,n (x) 



Hq=p \ 



hp,n(x)r] n (if) 



< P n P \\<P\\ V Cfj.v(x)fi (v) , Vx G X. 



Proof. See section [5j 



□ 



As a consequence of this theorem we obtain u-norm exponential stability with respect to 
initial condition for measures (rj n ). 



Corollary 1. Assume then with p and p as in Theorem{Jl for any p! G V v , there 

< P n c^'P (v) p! (v) , 



exists c^^i < 00 such that 



'In 'In 



, (p) PQo,n , (//) P Qo,n 

where rpj' := — — - and rjn ■= —tf> 7TT- 

»,«(!) |U'Qo,n (1) 

Proof. Taking the bound of Theorem [1] and integrating w.r.t. p! gives 



P'Qo,n(f) i lh \ (a) f \ 

: P (^0,n) O) 

1 lp=0 A P 



< 



P \m\v c ^P\ v )P W ■ 



It is stressed that in the above display X p and /io,n are as in definition [TJ i.e. dependent on p, but 
not on p! . Now as p! G V v , for any d G [d, 00), // (CJ) < // (ic^e^) /e < p (e ) /e d so there ex- 



ists d G [d, 00) such that //' (C^) > 0. Then dividing through by p! (/io,n) = P-'Qo,n(^)/ llp=o 



p'Qo.niD ^ m 



P' Oo,n) 



p{v)p (v) 



v p' {C d )mi xeCd h , n (x) 

< p n IMI„ c v,ii' p ( v ) p' ( v ) > 



M 0) 0) 



where the final inequality holds due to Lemma [TUl 



□ 



4.4 Exponential moments for additive functionals 

We now present a result on finite exponential moments for a class of additive, possibly unbounded 
path space functionals. It will be applied in section 14.51 to bounds on the relative variance 
associated with 7^ (1). The proof is mostly technical and is given in the appendix. 

Theorem 2. Assume (I{1^-(I^ and let 5 andv be as therein. Then there exists a finite constant 
c^ depending on p and the quantities in (I{J^-(I^ such that for any collection of measurable 
functions {F n ;n > 1} with each F n : X — > R and sup x (\F n (x)\ — 5V{x)) < 00; any n > 1, 
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< s < n + 1, and (ii, . . . ,i s ) £ I n ,s, 





rip=o g p ( x p) 


exp 


(Sfce{ii,... 


, ls} \Fk(X k )\) 




n P ": 


=o G P {X p ) 





fee{ix,...,i s } 



mi 



twii/i i/ie conventions that, when s = 0, i/ie summation on the left hand side is zero and the 
product on the right hand side is unity. 



Proof. See section [SJ 



□ 



4.5 Variance bounds 

Remark 4. At this point we introduce a further assumption, (IEJ) below. This assumption is 
not necessary for all of the results of this section but is employed for the following three reasons: 
1) it is not restrictive in filtering applications; 2) it allows Lemma [3] below to be invoked (an 
equivalent result can also be obtained without (HE]), but subject to constraints on the growth 
rates of (G n ) and the assumption that the Markov kernels (M n ) themselves obey a suitable 
simultaneous multiplicative drift condition); and 3) it allows an existing CLT for particle systems 
to be simply stated below without proof (see also remark 6). 

(H5) sup n > sup a;gX G ri (2;) < oo 

The following lemma plays an important technical role in the variance results which follow. 

Lemma 3. Assume (HJ^-(I^) with v the drift function in fi{7])-(7Q). Then for any a G (0, 1), 
the statements of (I{Ity-(I^4ty also hold for the drift function v\ := v a and with a-dependent 
constants. 

Proof. Let G := sup n>0 sup xeX G n (x). Then for all x £ X, and any d £ [d, oo) as in (HI]), 
sup Q n (e aV ) (x) < G sup 

n>l ' n>l 

= G 1 -"sup[Q n (e y )(x)] a 

n>l 

< exp [aV(x)(l -5) + ab d I Ci {x) + (1 - a)logG] , 

where Jensen's inequality and (HdJ) have been applied, and 5, and = {x £ X; V(x) < d} 
are as in (H[T]). Then for any 5q £ (0,5) and G < oo there exists £ [d, oo) such that for any 
d £ \d a ,oo) and x ^ {x £ X; aV(x) < d}, 

sup Q n (e aV ) (x) < exp \aV(x) (1 - 5 ) -ad (6- S ) + (1 - a) log G] 

n>l 

< exp[aV(x)(l-6o)], (4.9) 



G n ^l{x) 

G 



M„ 



(x) 
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and for x E {x £ X; aV(a;) < d}, 

sup Q n (e aV ) (x) < exp \ad(l — 5) + a& d + (1 — a) log G] 



n>l 



=: exp (b d a ) . 



(4.10) 



The statement of (HI]) holds with the drift function v\ := v a because equations (|4.9p - (|4.10|) show 
that we may replace d, 5, bj, G d in the corresponding statements with d Q , 5q, bd t a,{x G X; aV(x) < 
respectively. 

It is immediate that (H|2j) holds for u Q , because v > 1. (H[3])-(t(3|) a l so hold for by 
replacing d, G d ,e^~, ej~, i/ d with d a , {x G X;aV(x) < d}, e d/a , e+, , u d/a , respectively. □ 



4.5.1 Asymptotic variance for 77, 



N 



Remark 5. There are several existing CLT resul ts for the particle systems in question, see for 



example [Chopin 



2004 



Douc and Moulines 



2008] . We choose to present that of 



Del Moral 



2004, 



Proposition 9.4.2], as it holds immediately under (H]5]), and we may state also the corresponding 
asymptotic variance expression with essentially no further work. The restriction is that the 
stated result holds only for bounded functions. It is of interest to investigate whether the same 
result holds for a suitable class of possibly unbounded functio ns in terms of v, for example via 



Douc and Moulines 



2008], but this is beyond the scope of 



the techniques of IChopinl [20041 ] or 
the present article. 

The following CLT holds for errors associated with the particle approxi mation measures 
) • Straightforward manipulations of the asymptotic variance expression of Del Morall . l2004l . 

Proposition 9.4.2] show that it can be written as in (|4.1ip below, in terms of the /i-functions 

and A- values. 



'Del Mora 



Theorem 3. 



and measurable and any n > 1, 



2004 - Proposition 9.4-2]- Assume Then for <p : X 



bounded 



N( V % - Vn )(<p) -> Af{0,a 2 n ) 



in distribution as N — >■ oo, where 



°l : = Vn 



n-l 

(if - f] n (v?)) 2 ] + ^V P 



p=0 



Qp,n (<f) 



q=p "1 



(4.11) 



We can readily apply the result of Theorem [T] to obtain a time-uniform bound on the asymp- 
totic variance. 

Proposition 3. Assume (HJ^-(I^). Then there exists < oo depending only on \x and the 
quantities in (I{J^-(I^ such that for any n>l, 

orl<Vn ((p-Vn{f)) 2 +c fi y\\ 2 1 ^(v) 2 , 



where v is as in f-fQ]) and <p and a\ are as in Theorem^ 
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Proof. As (HI])-(EJ3D are assumed to hold with some drift function v , then by LemmaJHl the same 
assumptions hold with the drift function v 1 ! 2 and suitable constants. Then applying Theorem [1] 
(using the drift v 1 ! 2 and the corresponding instances (E(T])-(H|3|), and then Proposition Q] (using 
the drift v), we find that there is c M < oo such that 



Qp,n (</?) 
Uq=p X q 



< ^cr\\ v \\l^*)\(v) 

< p 2 ^\\v\\lw{v)\ 



and the statement of the Theorem follows by summing. 



□ 



4.5.2 Non-asymptotic variance for 7^ (1) 

For n > 1 and 1 < s < n + 1 , define 



n 



with the convention that i 



[in wr 



Building from 



«+i = " 



Del Moral et al 



2009 1 



Cerou et al. 



201 ll ] obtained a non-asymptotic func- 



tional expansion of the relative variance associated with 7^(1). Elementary manipulations of 
this relative variance show that it may be written in terms of the quantities f f^ 1 ' as fol- 
lows, and as we assume (H[5]), the quantities involved are well defined (although this is not a 
necessary condition, one may alternatively assume (HI])-(t(2]) ). 



Theorem 4. 



Cerou et al 



2011 



E, 



7n(l) 



Proposition 3.4]Assume (tfB$. Then for any n > 1, 
21 



1 



ra+l 



£ 1 



2 \ ^ 

N J iV* 



E 

(n,...,is)6Xn,s 



where the expectation is with respect to the law of the TV-particle system initialized from jj,. 

We may now apply Theorem [2] in order to obtain the following linear-in-n bound on the 
relative variance. 

Proposition 4. Assume (HJ^-(I^) and let v be as therein. Then there exists a finite constant 
Cu depending on \i and the quantities in (I{J^-(I{^) such that for any n > 1, 



N > C(1 (n + 1) 



E,, 



7?(1) 
7.(1) 



< c M — (n + l)n(v) . 



Proof. Throughout the proof c is a finite constant depending on \i and the quantities in (H[T])- 
(H2D whose value may change on each appearance. 
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First notice that by definition [T] and the product formula (|4.5|) we may write 



T (h,...i a ) = MgojiW 1 



MQo,ii(l) 7n (1) 

n-l 



E, 



n-l 



n °p ™ n 



P=0 j=l \ llfc=ij A fe 



G p (X p ) ^ J -,i J+1 (JQ. 

p=0 j=l 



(4.12) 



with the convention that Y\™ = 1 in the first equality to deal with the case i s = n. 

Let v and 5 be as in (H[T]). Then by Lemma [3j the statements of (E(T])-(Ifi]) a l so hold for the 
drift function v s and with constants which depend on 5. Then Proposition Q] and Proposition 
[2] both applied with the drift function v s and the corresponding instances of (Ljl])-(H|4j) of show 
that 

SUp SUp ||/ip, n ||^ < OO, 
n>l 0<p<n 



so that, using the representation (|4,12p . and applying Theorem [2] with the drift function v and 
the corresponding instances of (HHJ-(HU), there exists a finite constant c such that 



n-l 



UG p (X p )Hv 5 [Xi 

p=0 3=1 



< c s n{v). 



Therefore by Theorem [U 



E„ 



7n(l) 



n+1 , 1 



(n+l)-a 



N s 



E 



(il,...,is)€Zn,a 



The remainder of the proof then follows by the same arguments as Cerou et al 
of Theorem 5.1 and Corollary 5.2], so the details are omitted. 



2011 



Proofs 
□ 



5 Proofs and auxiliary results 
Auxiliary result for section 13.11 

Lemma 4. When f is the transition kernel corresponding to the model of equations \3. 
there exists d<oo and 5 > such that, for any d £ [d, oo) there exists b d < oo and 



f (x, dx') v(x') < v(x) 1 - 6 exp [b d I Cd (x)} , xeX, (5.1) 

X 



where v(x) := exp (1 + c\x\) for c a positive constant, and furthermore for each such d there 
exists < < < oo such that 

e d v d (AC\C d ) < f(x,AnC d ) < e+v d (AnC d ), x e C d , A e B (X) , (5.2) 
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with Vd the normalized restriction of Lebesgue measure to Cd- Furthermore f c f(x,dx') > 0, 
Vx G X. 



Proof. As per 
that 



Douc et al 



2009], under the assumptions on the model, there exists j3 < oo such 



fx f (x, dx') v(x') 
v(x) 



< Pexp[c(\x + B(x)\-\x\)] 

\x + B(x) 



(5 exp 



-ex 1 



and then using (j3.8|) . there exists 8\ > such that for |x| sufficiently large, 

\x + B(x) 



1 



so for such |x| and S G (0,<5i), 

f x f (x,dx')v(x') 



v(x) 



< exp [-V(x)5 -c\x\(8i-S) + log /3 + 1] 



and by increasing |x| further if necessary, we conclude that the result holds with bd := d + log/3. 
and J c f(x,dx') > hold immediately. □ 



Proofs and results for section 14.21 

The proof of Proposition Q] is given after Lemma [5] and Lemma [6] 

Lemma 5. Assume Then for any d G [d, oo), any n > 1 and 1 < p < n, the 

following inequalities hold, 



Sp : n \Vp,n) — Pp,nVp—l,n Bp n l.(j d 



where 



Pp,n '■- 
Bp^n 



p~ Sd \\h II 

1 l l hp 1,71 | | ,y 

e d(l-8)+b d 



< OO 



hp n 



1 



P,n\\ v 



< oo, 



c d Vd (Jc d hp,n) 

and with the dependence of p p ^ n and B p ^ n on d suppressed from the notation. 
Proof. For x ^ Cd, 

Q p (v) (x) 



(5.3) 

(5.4) 
(5.5) 



\-\hp-l y n{x) 



I hp n 



< 



V p-J.'"p- 
v(x) 



Ap-l/lp-l,n(x) 



e -M ll/i II 



u Pin _l(x) 



\p—i || hp—\ n ||^ 
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where (HI]) has been applied. 

For iGQ, from Lemma [I] and (E(3j), 



\ p -ih p -i tn (x) = Q p (h Pjn ) (x) > e d u d (Ic d h p , n ) 



and thus using (HHJ, 



<? (v )(r) < r d(l-S)+b d \\ h P,n\\ v 1 



a d(l-8)+b d 



< 



Vd(^C d hp, n )' 



We have p P;n < oo and B p>n < oo because for any p < n, A p _i > 0, /i Pi „ € C v , h p . n {x) > for 
all x £ X, and for any d > d, Ud (Cd) > 0. □ 

Lemma 6. Assume (HJty -(H^- Then for any d £ [d, oo), < p < q < n, and x G X , 



E$ [ %n [X q>n )] 

e -Sd(q-p) " ' 



< 



+ 



nriA, ii^ni 



d(l-8)+b d 



IIV 



1 



9^ -<5%-fc) 

+ E 



i 



TOi/i i/ie convention that the sum is zero when p = q — 1 . 



(5.6) 



Proof. For each n, p and g in the specified ranges, the proof begins by recursive application of 
the drift inequalities of Lemma [5j A simple induction yields 



Eg [v q , n (X q , n )] 

( q \ 9 ( 9 \ 

\k=p+l J k=p+l \j=k+l J 

with the convention that the right-most product is equal to 1 when p = q — 1. 
By the definitions of (h Ptn ), (p p , n ) and (Bp ;n ), 



(5.7) 



k=p+l 



n 



-Sd 



\\hk,n\\ v 



k=p+l 

e -5d(q-p) 



Afc-1 ||^fc-l,n||„ 
hq,n \\ v 



rifc=p+l^-l IIVn||„' 



(5.8) 
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and for k < q, 



H Pj,n I Bk,n 
y j=k+l J 

e -5d{q-k) ||^ 



g,n\\ v 



,d(l-S)+b d 



\Hj=k+l A i-1 ll^fc.nl 
d{l-5)+b d 



\\hk,n\\ v 



1 



Vd{lc d hk,n) 



-Sd(q-k) j 



(5.9) 



The proof is complete upon combining (|5.7[) . (|5.8[) . (|5.9|) and applying the definition of B qtTl for 
the case q = k. □ 

Proof. (Proposition^) . For n = we have trivially r/o (f ) = n{v). 
For n > 1, by Lemma [2j 



% («) 









E[g=0 G g P^s) 





< I fi (dx) ho,n(x)&<?> [v n , n (X n , n )] 



(5.10) 



where the inequality is due to h n>n = 1 and ||/in,n||„ < 1- The proof proceeds by bounding the 
expectation. 

Fix d 6 [rf, oo) arbitrarily. Applying Lemma [6] with q = n and p = 0, and again noting 
K,n = 1, ||^i,n|L < 1, we obtain 



4") [ Wn , n < 



-8dn 



1 



iusa* ii v 



-^0,n (Z) 



+ - 



d(l-5)+b d 



-Sdn 



n ~ l e -Sd(n-k) 



Vd {C d 



A t Qo,n(l) ||/lo,n||„ 



3 d(l-<5)+6 d 



vo,n (a?) 



n-1 



-<5d(n— fc) 



(5.11) 



with the convention (as per Lemma [S]), that the summation is equal to zero when n = 1. The 
equality is due to the definitions of the A-values and /i-functions. 

We now obtain lower bounds in order to treat the fj,Q Q <n (l) an d Vd \^C d Qk,n (1)] terms. Recall 
that d G [d, oo) was arbitrary. Now choose arbitrarily r € [d, d]. Then under (E(3j), for any rj £ V v 
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and any < k < n, 

V[lc d Qk,n(l) 



E 



k.i] 



n-l 



icA x k)U G i ^ 



> E 



q=k 
n-l 



k,n *c r (X k ) J] G q (X q ) l Cr (X q ) l Cr (X n 

q=k 

> 77 (C r ) [e-v r {C r )] n ~ k . 



(5.12) 



Under (E(2|), for r and d increased if necessary, but still subject to r < d, we have /j,(C r ) = 
1 — \i (Cf) > 1 — fj, floe ) e _r > 1 — [i (e v ) e~ r > 0. Now hold r constant and if necessary, 
increase d so that e~ Sd < [e~v r {C r )\ 1 . Equation (|5.12p then gives 



-Sdn 



-Sdn 



sup ■ 



< 



Then under (HI]), noting 1^ (C r ) > and applying (|5,12|) . 

p —Sd(n—k) 



< 



< CO. 



(5.13) 



sup 

n>l 


1 


+£ 

k=l 


v d {C d ) 


< 


1 


1 



sup 



^<f (Cd) Vd (C r ) n >i 



'n— 1 

E 

.k=l 



g—8d(n—k) 

I (n-k) 



[e r V r (C r )] 



< CO. 



(5.14) 



Combining ([5.13p . (|5.14p and (|5.1ip . establishes that there exists a finite constant c n , indepen- 
dent of n such 



fiW [v n>n (X ntn )} < 



1 1 

\1 (C r ) ||/lo,ra 

and then returning to (|5.10p . we have shown that 

1 1 



-v 0) n (x) + c M . 



\l (C r ) [|fto,n[|„ 
+c M y h , n (x)iJ,(dx) 

H (C r 



h ,n(x)v ,n (x) fi (dx) 



where the final equality uses the definition of vq n an d the property [i (ho n ) = r\ (h,Q n ) = 1 as in 
Lemma [TJ Thus there exists a finite constant such that 

suprfo («) < c' n(v) , 

n>l 



which completes the proof. 

Lemma 7. Assume (I%T\)-(lX3\) and let v be as therein. Then 



□ 



24 



SU P \\Vn\\ v < 00 < ^=^ inf A n > 0. (5.15) 

n>0 n >° 



Proof. (=>). Suppose sup n>0 ||%||„ < 00. Then there exists a finite constant rj such that for any 
d> d, 

supr/„ (Cg) < sup — — ^ < sup — ^ < r,e d . 

n>0 n>o e a n>0 e a 



Thus for all /3 < 1, there exists d > d such that sup n>0 ?7n (C^) < /3. Thus for /3 E (0, 1) there 
exists r > d such that 

inf A n > inf rj n (lc r Qn+i (Ic r )) > e^r (C r ) inf r/„ (C r ) > e~i/ r (C r ) (1 - /3) , 

n>0 n>0 n>0 

where the second inequality is due to (H2J). 

(<^=). Suppose inf n >o A n > 0. Then there exists A > such that for any n > 1, 

, v . r}n-\Qn (e v ) ^ r, n ^iQ n (e v ) 
Vn\ e ) = 77^ r < x -1 

where f|4.T|) has been used. Now set d > d V ^— — log A^ . Then under (i^, 

( s Vn-i [icgQn (e y )] r? n _! [l Cd Q n (e^)] 
Vn[e ) S A + A 

e -6d e d(l-8)+b d 
< —r h -i(e V )+ j 

=: pri n -i (e V ) + B, (5.16) 

for some p < 1 and S < 00. Iteration of (|5.16p establishes 1). □ 
Lemma 8. Assume (I{lty-(I^4ty and let v be as therein. Then 

inf A n > => sup sup ||/ip,n|L < 00 • 
n >° n>l 0<p<n. 



Proof. Recall the definition 



For the case p = n, h p>n = 1. For other cases we proceed by decomposing and then bounding 
the numerator. 

Set d S [d, 00) arbitrarily, let n > 1, < p < n and define Tp := inf > p; X g G C^, X q+ \ € C^}. 
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Now consider the decomposition: 
Q P) n(l)(x) 



n-l 
k=p 
+^p,x 



n-l 



J] G q (X q ) I {r« = k} 

q=p 
1 

nG,(x,)i{f >«} 



,<?=p 



(5.18) 



and define 



A> := IHQpIcjHk, B p := \\\Q p Ic d \\\v, 
p+j-l 



i := v (X p ) , 



-3 •" 



n 

<?=p a„ ^ 5 d 



q+l D q+1 



Assumption (HI) implies that, for 1 < j < n — p, IE p +j-i,x p+J _i [£j] < ^j-i; so that 

E p , x [S n _ p ] < E PiX [H ] = v(x). (5.19) 
For k > p, define := Ylq=p^-C^ {X q )- Then the following bound holds under (HQ]): 



11^9+1 ^9+1 



.<7=P 



MM. 



g>(fc-p)/2} 



(p,fe) 



I {m$ > {k -p) /2| |lV sup IHQJI, 
< exp [- W (fe - p) /2] exp [6^ (fc - p) /2] . 



(k-p)/2 



< SUP [||Q 3 IC7=||U 
\9>1 



(5.20) 



Douc et al 
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where |||„ < has been used. 

Consider one term from the summation in (|5,18p with p < k < n. By 
Lemma 17], I {rf > k] = I l Cd (X q ) I Cd (X g+1 ) = j < I {m$ > (k - p) /2}. Then 

combining (|5.19p and (|5.20p and using (HHJ, 



E 



'p,.r 



'n-l 



L<?=P 



<e> d [I Cd Q fc+ i,„(l)]E p , 



fc-i 

[] G 9 (X,) I {mJJ > (k — p) /2} t> (X fc ) 



.9=P 



< e|^d [Ic d Qfe+l,n(l)] 

• exp [Sd (k - p) /2] exp [bj (k - p) /2] , fc > p, 



(5.21) 
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similarly 



E 



p,x 



n-1 



tT > n 



1=P 
'n-1 



<^p,x 



J] G q (X q ) I {M$ > (k—p) /2} « (X ri 

<3=P 

< exp [— <5cf (n — p) /2] exp [bd (n — p) /2] , 



(5.22) 



and also by (HU), 



E 



n-1 



tq=p 



< 



4 U d[^C d Qp+l,n( 1 )] V ( X )^ 



(5.23) 



recalling from section 14.11 the convention Q n +i jn = Id. Returning to (|5. 18[) . the bounds of 
(f5T2T]) - (f5T23|) show that for p < n, 



Q p , n (l)(x) 



n-1 



< 4< X ) Yl eX P I' 6 * 1 ( k ~ P) eX P [ & i ( k ~ P) / 2 ] ^ \fciQk+l,n(X)] 
k=p 

+v(x) exp [— 5d (n — p) /2] exp \bd (n — p) /2] . 



(5.24) 



We now turn to the denominator of (|5.17f> and stress that we are continuing to use the same 
arbitrary value of d as above. 

As per the statement of the Lemma, suppose A := inf„>o A n > 0. Then by Lemma [71 
f) := sup n > ??n ( eV ) < 00 an( i choosing independently e 6 (0, 1), by (HI]) d may then be chosen 
large enough that 

inf *M (C d ) = inf 1 - Vn (C c d ) > inf 1 - Vn (l c °e v ) e~ d > 1 - 7?e" d > 1 - e =: V . 

n>0 n>0 n>0 a — 



Then for p < k < n, 



'k-1 



\q=p 



> \ {k - p) Vk[ic d QkA l )\ 



> e. 



■ X {k - p) Vk (Cd) v d [Ic d Q fe+ i in (l)] 



> eTA^Vd [I Cd Q fe+1 , n (l)] 



(5.25) 



and for p < n, 



n P Qp,n{ 1 ) > e d Vp {Cd) v d [icaQp+iA 1 )} > e dR u d \i Cd Q P +i,n{ l )\ , 



(5.26) 



and also 



VpQpM > A (n_p) - 



(5.27) 
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Combining ([5.25|) - ([5.27|) with (|5,24|) and (|5.17p we finally obtain, for p < n, 

_j_ n— 1 

hp, n (x) < ^v(x)Y j exp[-5d(k-p)/2}exp[(k-p)(b d /2-logX)] 

+ v(x) exp [— 5d (n — p) /2] exp \{n — p) (bd/2 — log A)] . 

Then increasing d further if necessary, we conclude that there exists c < oo such that for any 
su Pn>i su Po<p<n h Pt n(x) < cv(x) and this completes the proof. □ 

Lemma 9. Assume and let v be as therein. Then 

sup sup ||/i p ,n||j, < oo ==> inf X n > 0. 

n>l 0<p<n ' n >° 
Proof. Suppose sup n>1 sup 0<p<ri ||/ip,n||„ < oo. Then by Lemma [1] and (H[3]), for any x G Cj, 

■ t \ ■ c Qn+l (hn+i n+l) (x) 

mf X n = mi — 

n>0 n>0 h n>n+ l(x) 

Qn+l (C d ) (x) 



> inf 



n>0 H/i^n+ll^ v(x) 

e d sup n > ||/l n ,n.+l 1 1 j, 
> 0. 



> 



□ 



Proofs for section 14.31 

The following Lemma will be used in the proofs of Theorems Q] and [2J 

Lemma 10. Assume and let d be as therein. Then for any d £ [d, oo), 

inf inf inf h v n (x) > 0. 

n>10<p<nx£C d y ' 

Proof. We will prove a finite, uniform upper bound on 

1 VpQp,n (1) 

x&c d n p ,n{x) x€ c d QpA l A x ) 

T]pQp,n (1) 



mf xe c d QpA l )( x ) 



(5.28) 



The proof uses the same approach as in the proof of Lemma [9] and therefore some steps are 
omitted for brevity. For the case p = n, r] n Q n ^ n (l) = 1 and Q n ,n(X){ x ) = 1 f° r an x. For the 
remaining cases we proceed by considering the numerator of 15.281 

Set d € \d, oo) arbitrarily, let n > 1 and p < n and define Tp := inf {q > p; X q € Cd, X q+ \ E Cd}- 
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We have the decomposition 



n-1 



P,Vp 



k=p 



'n-1 



+E. 



p,v P 



l[G q (X q )I r^ = k 

q=p 
1 



n-1 



r f > n 



(5.29) 



This is of exactly the same form as in equation (|5.18|) in the proof of Lemma [5J except for 
the initial measure rj p . Thus by exactly the same arguments (integrate equation (|5.24p w.r.t. 
r]p) we obtain the bound 



f]pQp,n (1) 



n-1 



< 4v P (v) ex P i~ Sd ( k ~ P) / 2 ] ex P [h ( k - P) I 2 ] "d [Ic.Qk+iA 1 )} 

k=p 

+7] p (v) exp [Sd (n - p) /2] exp [b± (n - p) /2] . 
Now set r € [d, d]. For the denominator of (|5.28p we have by (H3|), 



inf Q pn (l)(x) > inf Q p [I Cd Qp + i,n(l)]{x) 

x£C d x£C d 



also 



> e d J/ d [Ic r Qp+i,n(l)], 



n— p— 1 



and for p < A; < n, 



[Ic r Q P +i,n(l)] 



> e d E p+ i )!/d 

> e d Ep + i iiyd 



n-1 



i cr(Xp+i) n g «™ 



<?=p+l 

n-1 



icpfjH-i) II G ff (jr ff )i Cd (^ fc )i C|I (x fc+1 

<7=P+1 
fc-1 

lc r {X p+1 ) H G,pg 



<J=P+1 
k-l 



<?=p+l 

> e~i/ d (C r ) [e~i/ r (C r )] k p 1 e d [Ic d Qfc+i,n(l)] 



(5.30) 



(5.31) 



e d v d [l Cd Qk+i,n{ 1 )] 



(5.32) 



29 



Combining 1|538) . (15301) . (15~5TD and (|Q2t gives for p < n 
1 

sup I — TT 



< — Vp («) 

4 % («) 



+ 



e d e , i/ d (C r ) 



n-1 

^ exp [Sd (k - p) /2] exp [(A; - p) b d /2 + (fc - p - 1) log [e^i/ P (C r )]] 
^k=p+i 

exp [_ M ( n _ p ) /2 ] exp [(n - p) {bd/2 + log [e~u r (C r )] )] , 

with the convention that the summation is zero when p = n — 1. With r kept fixed, increasing 
d and noting that under the assumptions of the lemma, Proposition Q] holds, we conclude that 
there exists a finite constant c^{d) such that 

1 

sup sup sup - — — < c M (d). 

n>l 0<p<n x£C d hp,n\X) 

The proof is complete because d\ < =>• C C^ 2 . □ 



Proof. (Theorem\lty. The proof is based directly on those of Douc et all I2009J. Proposition 12 

and L emma 15], which are in turn developments from the decomposition ideas of lKleptsvna and Veretennikov 
|2008l | . However, there are some crucial differences here: the focus of the present work is on the 
u-norm on measures, as opposed to total variation, and different techniques will be used to deal 
with and control denominator terms in equation f|5.33[> below, by way of Propositions [1] and [2] 
Throughout the proof, c is a finite constant whose value depends on \x and the quantities in 
(HU)-(fEl) an d whose value may change on each appearance. 
Let (X n ; n > 0) be the bi-variate Markov chain on X 2 with 



X n \ {X n _i = [x n -x,x' n _i)} ~ M n (x n _i, •) ® M n {x' n _ x , •) 

and for some distribution H on X 2 we denote by E# the expectation with respect to the law of 
this bi-variate chain initialized by Xq ~ H. In line with previous definitions, for ij a distribution 
on X we write K p ^ x ®„ := f 5 X (dx) r) (da/) E# [99 (X p , . . . , X n ) | {X p = (x, x')}] . Also define 
C d '■= Cd x Cd and throughout the following writing x = (x,x r ) for a point in X 2 , define 
G n (x) := G n (x) G n (x') and v (x) := v (x) v (x') . 

For each n > 1 define the tensor-product kernel Q n (x,dy) := Q n (x,dy) (g) Q n (x' ,dy') and 
let (Q P) n) be the semigroup defined in the same fashion as (|4,ip . 

Now fix arbitrarily d G \d,oo) and define, for n > 1, 

R n (x, dy) := Q n (x, dy) - I Cd (x) (el) 5 v d ®v d {dy) 
and (R P: n) i n the same way. The dependence of .R n on d is suppressed from the notation. 
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First set n > 1 and < p < n arbitrarily. We have from the above definitions, 

Q P ,n (,<p) 0) 



Vn(f) 



hp,n{x) Uq=p \ 

Qp,n {<f) (x) VpQp,n (</>) 



Qp,n (1) (x) VpQp,n (1) 

(5 X <g> rjp) Q p , n (ip (8) 1) - (r/ p (8) <5 X ) Q P ,n (y> <8> 1) 



Qp,n (1) {x)r] p Qp,n (1) 

® ?7 P ) -Rp.n (y g 1 - 1 g </?) 



< 2 



<2 P ,n (1) (x)ri p Q p ^ (1) 

V Qp,n (1) (x)r]pQ p , n (1) 



=: 2 



Ap,n {.X, Tjp) 



Qp,n (1) (x)ri p Q P: n (1) ' 



(5.33) 



where the third equality is due to the decomposition of iKleptsvna and Veretennikovl [20081 ] . 

/ e -\ 2 

Now define p d := 1 - ( ^ ) < 1 and := % d 1^ (X fc+ i). Following es- 

V e d / r 

sentially the same argument as 
(0,1), 



Douc et al 



2009, Proof of Proposition 12] then gives, for any 



A p , n (x, rip) < E Sx(B)rlp 



'n-1 



<3=P 

[] G g pf-"I {IJJ > /? (n - p)} v {X n ) 
q=p 

n ~ 1 _ ( d ) 

[] Gq (X q ) pf^I < /? (n - p)} S (X„) 



=: Ag (x,r/ p ) + Ag {x, V p). 



.i=p 



We first consider /s!p) n (x,rj p ). As < 1, we have the bound 



^p,n (x, T/p) 



Qp,n (1) {x)ripQp, n (1) 



< 



Pd 



/9(n-p) 



Qp,n(v)(x) 

Q p>n (l)(x) 



Vn(v), 



but using Lemma [6] Proposition [TJ Proposition [2] and Lemma [TO] show that for r large enough, 
but then fixed, 



Qp,n( v )( x ) 
Qp t n(l)(x) 



< E%l[v n , n (X nin )) 



e -Sr(n-p) ^ 
< ; —— — Vp, n (x) 



x (n - p) II VL 



+ 



< c 



z r(l-8)+b r 

^p,n (x) 
1 1 hp,n 1 1 v 



n 1 _ Sr{n— k) ^ 



+ E V 



^r(Ic>p,n) .fl, A< n-fc > Vr(lc r hk,n) 

K — p-\- 1 



(5.34) 
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so 



Ap,n (x, Tip) P{n—p) v p,n i x ) / \ 

Q p , n (l)(x)r, p Q p , n (l) ~ CPd ¥v~J v W) 



(5.35) 



Douc et al. 
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\\hp,n\\ v 

where the second inequality is due to Proposition [TJ 

Now consider A p> h (x, r} p ). The main idea for treating this term is that of 
Proof of Lemma 15]. There are some cosmetic differences of indexing, but some intermediate 
steps are omitted for brevity. Define 

n-l 

M fl ■= E l Gl > °p.« : = L(" - P) (1 - /?) /2 " V2J , 



p+fc-1 



lo := u (X p ) , Ek :- 



n 



q=p /l d v H > f> c d\ q > 



Bp '■— \\\Qp&C d \\\v®vi 

v (X p+k ) , 1 < k < n - p. 



Then for 1 < k < n - p, < Efc-l, 



so that 



E 



[^n—p] — ^P,< 



i ] = v(x)r) p (v) < cv(x)n (v) , 



(5.36) 



where t he last inequality is due to Proposition Q] 



By 



Done et al 



20091 . Lemma 19], M p d n < /3 (n — p) implies Mp\n > a PjU , and then 
'p+k—l 



A q+l ^q+l 



q=p 



I 



(. 



Mj$<l3(n-p)} 

\ 2(n—p— a Pl „) 



< Slip A q 



1 V sup IHQgllk 
5>1 



2(n-p) 



1 V SUp HlQglHu 
9>1 



2(n-p) 



< SUp IHQglcsllk 

<e-**^.» ^lVsup|||Q 9 |||,J 

< exp (—5da Ptn ) exp [0 V 26^ (n — p)~\ . 



(5.37) 



where (HI]) has been used. For the remainder of the proof we may assume without loss of 
generality that > 0. 



32 



Combining (|5.36p and (|5,37p then gives 



"Hp 



n-l 



]^[ Gg (X,) I {m<J < /3 (n - p)} « (X n ) 



L<?=p 

sup -4,, ] I sup //,, ] E 

3>1 / \9>1 



< cexp [— 5da P; n + 26^ (n — p)] v(x)fi (v) , 



and therefore 



Q P ,n (1) (x)r]pQp, n (1) 



< cexp [-5da Pin + 2 (n - p) (bj - log A)] 



H{v) 



P,n\\v 



where Proposition [T] and Proposition [2] have been applied and A = inf n >o A„ > 0. 
Collecting the bounds of (I5.35p . (|5.38p and returning to (|5.33p . we establish that 



Q P ,n M (x) 



<MW\\ V 



n—l \ 
q=p A Q 

\\hp,n\\ v 



H{v) 



Mn-p) 



+ exp [-5da Pt n + 2 (n - p) (6^ - log A)] 



fl(n~p) 
Pd 



+ exp [- (n - p) (<W(1 - /3)/2 - 26^ + 2 log A) + 35d/2] 



(5.38) 



where for the second inequality, [a\ > a — 1 has been used. The proof is complete upon recalling 
that d £ [d, oo) was arbitrary, pd < 1, /3 G (0, 1) and multiplying through by h p ^ n {x). □ 



Proofs for section 14.41 

Proof. (Theorem^). Throughout the proof c denotes a finite constant whose value may change 
on each appearance but which depends only on // and the quantities in (H[T])-(fE]). Also, through- 
out the proof we take by convention that for any j < k, ^{ = 0. 
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First consider the case s > 0. By Lemma [2J 





rip=o °p ( x p) ex p 


^fee{ii,... 


,i.}l*k(*k)l)" 






(Xp) 





(dx) /l ,„ (^)IEi n) 



exp j ^ (-Xfc.n) 

fce{il,...,i s } 



' n 

vfce{ii,...,i s } 



n (^,«) 

fee{ii,...,i s } 



(5.39) 



We now obtain some bounds which will be used to control the expectation in (|5.39p . Proposition 
[1] holds under the assumptions of the theorem so we may apply the upper and lower bounds of 
Proposition [2] and Lemma [10] to the bound of Lemma [6] and choose d therein large enough, in 
order to establish that there exists a finite constant c independent ofl<p<g<n and x G X 
such that 

Eg [v q>n (X q>n )] 
< ; —— T^V P)n (x) 



\(1-p) \\hp >r 



a d(l-8)+b d 



\\hq,n\ 



V d (Ic d hq, r 



1^9," I \ v i \ 



II^P,nlL 

where A = inf n >o A n > . Therefore by (HI]), for p < q, 



(5.40) 



)f ^ { A x [v q ,n(X q , n )] < «/(z)f^ 

P,n\\ 

v Ap_i/i p ^i jn (x) 



U' )£.._, I ( ^ ... ) ••' rr \ .r ) ;;, " ,. J S p>n (v p>n ) (x) 

l P,n\\v 



cv s (x)\\h q 



Qp (v) (x) 



< c 



< c- 



v (x) e' 

Vl Vl,nWl 
W^q, n\\ v , \ 

-^f-Vp-l^X). 



\\hp~l,n\\ v 

Now fix n > 1 , 1 < s < n + 1, (ii, . . . , i s ) G X n _ s arbitrarily and define (Sfc n ; < k < s) by 

^0,n (^0,n) 



(5.41) 



— 0,n 



\\Kn\\ v 



ll^fe.nlli) 

where c is as in (|5,4ip . We then have 



exp 



fc-l 

Y,{W (X ihn )-\ogc) 

3=1 



1 < k< s. 
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E (n) 



L 1 — 'k,n\ 



< c 



fc-1 



7/ (X ifc _ 1)n ) exp 



^(^y (% ft )-iogc) 

_i=i 

fc-i 

£(*V (**,,»)- log c) 



^fc— l.rij 



where the inequality is due to ([5.4ip , Thus (Sj. n ,.Ffc,njO < < sj is a super-Martingale, with 
•Ffc.n := cr (X 0) n, . . . ,X ifc _i jn , X ife) „). Therefore 



fe€{ii,...,i s } 



,Vo,n (x) ,,, n 



< c s 



v (x) 
h ,n(x) 



(5.42) 



where Propositions [TH2] have been used for the last inequality. 

The proof is completed upon combining (|5.42|) with (|5.39p and noting that the result holds 
trivially when s = 0. □ 
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